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1.  SUMMARY 

~~  A  robust  test,  which  we  call  an  aligned  generalized,  M-test  for  testing 
subhypotheses  in  the  general  linear  models  is  developed,  and  its  asymptotic 
properties  are  studied.  The  test  is  a  robustification  of  the  well  known  F— test,  and  it 
is  an  elegant  and  practical  alternative  to  Ronchetti's  (1982)  class  of  r-tests. 
P-values  associated  with  it  can  be  approximated  readily  using  existing  chi  square 
tables,  unlike  Ronchetti’  s  test.  The  test  is  based  on  an  appropriately  constructed 
quadratic  form,  and  uses  the  generalized  M-estimators  of  the  parameters  in  the 
reduced  model.  Under  the  null  hypothesis  the  asymptotic  distribution  is  a  central 
chi  square,  and  under  contiguous  alternatives  is  a  non-central  chi  square  with  the 
same  degrees  of  freedom.  The  test  can  also  be  viewed  as  a  generalization  of  Sen' s 
(1982)  M-test  for  linear  models.  - -  ^  o  >  c 
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— ~~Z)  The  influence  function  of  the  test  is  bounded.  The  bound  not  only  applies  to 
the  influence  of  residuals  but  also  to  the  influence  of  position  in  the  factor  space. 
Sen' s  test,  on  the  other  hand,  has  bounded  influence  only  in  residuals. 

Some  key  words:  aligned  generalized  M-test;  bounded  influence;  contiguous 
alternatives;  influence  function;  linear  model;  random  carriers;  robustness;  r-test. 


2.  INTRODUCTION  AND  THE  MODEL 

We  consider  the  following  model:  {(Xj,yj);  i  =  1,2,-  •  -,n}  are  independent 
random  variables  such  that 

yi-sTtf+S  (2. 


where  Xj  is  independent  of  and  has  distribution  function  K(x)  with  density 
k(x).  In  the  above  linear  model  it  is  well  known  that  least  squares  estimates  are 
very  sensitive  to  aberrant  data  points,  that  is  points  that  deviate  significantly  from 
the  bulk  of  the  data.  This  sensitivity  has  led  to  various  proposals  for  robust 
methods  of  estimation.  Among  those  proposals  are  the  classical  M— estimates 
introduced  by  Huber  (1973),  and  the  generalized  M-estimates  introduced  by 
Hampel  (1977)  and  discussed  by  Maronna  and  Yohai  (1981).  >  For 

ii 

Parameter  estimation  is  the  first  step  in  data  analysis.  Often,  we  are 

...:d 

interested  in  testing  if  a  number  of  linearly  independent  estimable  functions  are  •  ior 
equal  to  zero.  Through  a  transformation  in  the  parameter  space  this  hypothesis 
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reduces  to  the  hypothesis  of  testing  if  a  certain  subvector  of  the  vector  of  unknown 
parameters  equals  zero,  treating  the  remaining  parameters  as  nuisance  parameters. 
Known  robust  testing  procedures  are  the  p  -test,  introduced  by  Schrader  and 

v 

Hettmansperger  (1980),  which  makes  use  of  Huber's  M-estimators,  and  the 
alternative  to  the  p  -test,  an  aligned  M-test,  which  was  introduced  by  Sen  (1982). 

V 

T  T  T 

Let  j}  =  (i2j  where  £  is  a  p«l  vector  and  is  a  qxl  vector, 
q  <  p.  The  above  testing  procedures,  for  the  hypothesis  Hq  :  ^ 

unspecified,  are  robust  against  points  that  exhibit  large  residuals.  The  influence 
with  respect  to  the  residuals  of  the  above  testing  procedures  is  bounded.  Their  total 
influence  though  is  unbounded,  since  the  part  of  it  that  corresponds  to  the  influence 
in  the  factor  space  is  unbounded  (see  Hampel  et  al.,  1986,  page  354). 

Ronchetti  (1982)  introduced  the  class  of  r-tests  which  makes  use  of  the 
generalized  M-estimates,  which  have  bounded  influence.  The  class  of  r-tests,  for 
testing  the  hypothesis  Hq  :  -  £L  unspecified,  is  defined  by  means  of  the  test 

statistic 


snUr*"jtf  yi.—Jn)  :=  (2-2) 

n  ..  __T  /  nn  \  ..  „T  /it  \ 

=arl«->2 

i=l 

where  (I  J)n,  (1q)q  are  the  generalized  M-estimates  in  the  reduced  and  full 

model  respectively,  r :  Rp  x  R  — >  R+  is  a  function  such  that  (i)  for  all  &  €  Rp, 

r  €  R,  r(x;r)  j 1 0,  r(jj;r)  >  0,  r(j s;0)  =  0,  and  (ii)  for  all  x  €  Rp,  r(&  • )  is 

differentiable.  See  Hampel  et  al.  (1986,  p.  345)  for  a  complete  set  of  regularity 

o 

conditions.  Large  values  of  S*  are  significant. 
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The  asymptotic  distribution  of  Ronchetti's  (1982)  r— test  statistic,  under 
Hg,  is  given  by  the  distribution  of 


where  Wi  -  W2 2 

matrix 


L=  l 


(2.3) 


>  Ap  >  0  are  the  q  positive  eigenvalues  of  the 


K  =  Q 


M-1  - 


'Ml! 0 

0  0 


(2.4) 


where  is  the  inverse  of  the  upper  (p-q)  *  (p-q)  part  of  the  M  matrix,  and 

* 

M  =  E{l'  (Kr)l  ST},  Q  =  E(,2U;r)s  XT}  (2.5) 


where 


r(x;r)  =  »Kx;r)  ,  |f  >fe)  =  q'  U;r).  (2.6) 

Further,  Nj,  j  =  p-q+1,*  •  *,p  are  independent  standard  normal  random  variables. 
The  most  important  choices  of  the  ^-function  are  of  the  form 


i(4) = ■'Os)*c(«y 


(2.7) 
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where  v ;  rP  r+  is  a  weight  function,  a  >  0  and  r  is  the  residual,  and 
if}  (t)  =  max[-c,min(c,t)j,  the  Huber  ^-function. 

The  r— test  can  be  viewed  as  a  complete  robustification  of  the  well  known 
F-test,  since  it  accommodates  points  with  large  residuals  and  points  with  high 
leverage.  It  is  impractical  however,  since  it  is  difficult  to  calculate  the  p— values 
associated  with  it. 

We  will  develop  an  alternative  test,  which  can  be  viewed  as  a  piactical 
alternative  to  the  class  of  r-tests.  We  call  it  an  aligned  generalized  M-test,  and  it 
is  based  on  a  properly  constructed  quadratic  form  which  makes  use  of  the 
generalized  M-estimates.  The  bounded  influence  property  of  the  generalized 
M— estimators  carries  over  to  the  test  based  on  them.  Thus  the  influence  function  of 
the  aligned  generalized  M-test  is  bounded. 

The  asymptotic  distribution  of  the  aligned  generalized  M-test,  under  the 
null  hypothesis,  is  that  of  a  central  chi  square  random  variable  with  q  degrees  of 
freedom.  Therefore  p-values  associated  with  the  test  statistic  can  be  approximated 
readily  from  existing  chi  square  tables.  The  asymptotic  distribution  of  the  test, 
under  contiguous  alternatives,  is  that  of  a  noncentral  chi  square  random  variable. 


3.  THE  EROPQSEELTEST,  ASSUMPTIONS  AND  DISTRIBUTION  THEORY 

The  model  is  that  of  section  2.  We  will  assume  the  regularity  conditions 
(Al)  -  (A6)  and  (Cl)  -  (C6)  listed  in  Maronna  and  Yohai  (1981).  The  notation  is 
slightly  different,  however  we  will  not  reproduce  all  the  conditions  here. 

In  addition,  we  assume  that  the  density  of  the  residuals  is  bounded.  Without  loss  of 
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generality,  assume  that  a  —  1.  In  practice  a  has  to  be  estimated  from  the  data. 
We  will  discuss  the  estimation  of  the  scale  parameter  a  in  a  latter  section. 

Estimators  based  on  (2.7)  have  been  studied  by  Hampel  (1977)  and  Krasker 
(1980)  for  the  special  case  of  i/(x)  =  l/|Jxl|,  where  ||*jj  is  the  Euclidean  norm  of 
X,  and  for  the  general  case  by  Maronna  and  Yohai  (1981). 

We  will  use  generalized  M-estimators,  defined  implicitly  by  the  (vector) 
equation 


n 


1  4v  yrsTsk  = 

i=l 


a 


(3.1) 


Define  the  dispersion  function 


D(£)  =  l  rify  ypxjilkj- 

i=l 


(3.2) 


Its  gradient,  with  respect  to  is 


vD(i2)  =  -  l  T]{ yj-xT^Xj. 


(3.3) 


i=l 


Let 
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Q 

S(£)®-VD(£)»  l  Tlify  (3.4) 

i=l 

Let  Xj  =  Uj  Xj  ),  where  is  a  q— dimensional  vector,  be  the 
corresponding  partition  to  the  partition  of  the  vector  of  unknown  parameters.  To 
test  the  hypothesis  Hq  :  ^  =  Q,  ^  unspecified  versus  Hj :  £2  t  A,  ^ 
unspecified,  define  the  statistic  as  follows: 


where  is  the  reduced  model  generalized  M-estimator,  U  is  a  consistent 

estimate  of  the  asymptotic  variance-covariance  matrix,  which  is  defined  in 

Theorem  1.  ‘ 

Large  value,  of  W*  are  significant.  The  teat  statiatic  wj|  definea  an 

aligned  test,  which  we  call  an  aligned  generalized  M-test.  We  will  discuss  the 

2 

distribution  theory  of  the  test  statistic  Wa.  Define 

L(l?)=S(^)-nM(^)  (3.6) 

where  is  the  true  parameter  and 

n 

*  =  1  I  ^(Xpyrx^XixJ 
i=l 


(3.7) 
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is  a  consistent  estimator  of  the  matrix  M  defined  in  (2.5),  and  is  defined  in 
(2.6).  Assuming  the  regularity  conditions  of  Maronna  and  Yohai  (1981),  the 
following  results  hold.  Proofs  are  given  in  Markatou  (1988). 

Lemma  1 

Given  K  >  0 


sup  Ilir^sun-U-’^MII  (3-8) 

V51I«oIISk 

converges  in  probability  to  zero. 

Lemma  2 

T 

Consider  the  random  variables  fKxpypX  j  l?)Xj.  i  =  1,  •  •  •  ,n  and 

J  a 

j  =  1,2,  *  *  •  ,p  where  the  (y^)  are  independent  and  identically  distributed  random 
vectors.  Then,  under  the  assumptions  of  finite  variance  and  zero  mean 

n-1/2S(^j)  =  n-1/2  l  rtxpej)^ 
i=l 

converges  in  distribution  to  a  random  variable  2s  where  2  ~  MVN(Q,Q)  with*  Q 
defined  in  (2.5). 

Lemma  3 

n1/2^  -  Qq)  converges  in  distribution  to  the  random  variable  2  where 
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Z~MVN(fi,M  *QM  *),  with  M  and  Q  given  in  (2.5)  and  j?  is  the  generalized 
M-«stimator  in  the  full  model. 

Ihwrem  1 

Under  the  null  hypothesis 


H0:l2  =  4  = 


the  test  statistic  =  n  *n  has  asymptotically  a  chi 

square  distribution  with  q  degrees  of  freedom,  where 


0-^22-^21^1^12  ^21^11^12  +  ^21^11^11^11^12 


and 


n 

s2(A>)  *  I  (310) 

i=l 

and  Qq  is  the  generalized  M-estimator  in  the  reduced  model. 


We  use  contiguity  as  it  is  defined  by  Hajek  and  Sidak  (1967,  page  202).  We 
regard  the  given  testing  problem  as  a  member  of  a  sequence  {IlQn,HlQ},  n  >  1  of 
testing  problems,  and  we  consider  alternatives 


Ijjj :  /^j  —  n  ^  j  P~ 1 » *  ■ '  >P» 


where  A1  =  -,Ap)  Is  a  q*l  vector  of  finite  constants.  Then: 


K.1 

H0n^n=[„  J  ™ 


^ln 

Hln;^=[0-l/24 


Theorem  3 

Under  the  contiguous  alternatives 


the  test  statistics  n  h*8  asymptotically  a  noncentral 
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chi  square  distribution  with  q  degrees  of  freedom,  where  6  is  the  noncentrality 
parameter  defined  by 


«  =  ATM2ilU-lM22 


(3.11) 


Sen  (1982),  introduced  the  aligned  M-test  in  linear  models,  based  on  the  classical 
M-estimators.  Note  that,  if  fKx;r)  =  ^c(r)  where  ^£(r)  =  max{-c,min(c,r)}  is 
Huber's  phi— function,  and  r  denotes  a  residual,  the  above  aligned  generalized 
M-test  reduces  to  Sen' s  M-test.  Therefore,  Sen' s  M-test  is  a  special  case  of  the 
aligned  generalized  M-test.  The  noncentrality  parameter  6 ,  *  in  the  case  of  simple 
linear  regression  and  with  »^2t;r)  =  ^c(r),  is  given  as 


6  = 


,2  2  2 
A  7  <rx 


(3.13) 


n 

where  A  =  A,  7=  E(^(r)]  and  n-1  £  (xj-x)2 

i=l 


that  is  Sen's 


noncentrality  parameter. 


T 
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4.  INFLUENCE  FUNCTION 

We  calculate  the  influence  function  of  the  aligned  generalized  M-test  and 
show  that  it  is  bounded.  Roughly  speaking,  the  influence  function  formalizes  the 
bias  caused  by  one  outlier.  It  was  introduced  by  Hampel  (1968, 1974)  and  it 
describes  the  effect  of  an  infinitesimal  contamination  at  a  point  on  the  estimate, 
standardized  by  the  mass  of  the  contamination. 

Its  formal  definition  is  as  follows: 

T((l— t)F+tA  ) 

IF(x;T,F)  =  lim - 7 - — 

tlO  1 

for  those  x  e  JT  where  the  limit  exists.  T  is  an  estimator  which  is  a  functional  or 
asymptotically  can  be  replaced  by  a  functional,  F  is  a  distribution  function  and 
Ax  is  the  distribution  that  concentrates  all  the  mass  at  x. 

The  influence  function  of  a  test  statistic  describes  the  effect  of  an  outlier  in 
the  sample  on  the  value  of  the  (standardized)  test  statistic,  and  therefore  on  the 
decision  (acceptance  or  rejection  of  Hq)  which  is  based  on  this  value.  It  is 
analogous  to  the  influence  function  of  the  estimators,  which  the  test  is  based  upon. 

The  influence  function  of  the  aligned  generalized  M-test  is  calculated  under 
the  null  hypothesis  HQ  :  =  2*  unspecified.  It  is  proved  that,  under  certain 

conditions  on  the  7-function,  the  influence  function  is  bounded.  Let  F^&y) 

denote  the  joint  distribution  function  of  (x,y). 

Note  that  the  model  distribution  under  Hq  is  F«  .  Define: 


t 
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m  =  J  nix  y-iTa»**dF^l;y) 


(4.1) 


Tfrrarem  3 

Let  WQ  be  the  aligned  generalized  M— test,  and  assume  the  regularity 
conditions  of  Maronna  and  Yohai  (1981)  are  satisfied.  Assume  J(i^)  =  0  (Fisher 
consistency).  Then,  the  influence  function  of  the  test  statistic  at  (fyyg)  is  given 
as: 


iF^y^w^)  =  |  jKso;y0-Xo  A))  I 


T 

M11M12 

*0 

I 

U"l(-M2lM7[  IJsp 


ll/s 


The  proof  of  Theorem  3  is  similar  to  the  proof  of  Proposition  1  in  Hampel  et  &1. 
(1986,  p.  350)  and  hence  we  do  not  repeat  it  here;  see  also  Markatou  (1988). 

The  most  important  choices  for  the  function  r?(x;r)  are  of  the  form  given  by 

T 

(2.7).  In  order  for  the  influence  function  to  be  bounded  Axq  must  be 

bounded,  where 


-MUM12 

I 


I). 


(4.2) 


Note  that  the  matrix  A  does  not  depend  on  the  outlying  case  (Xg,y0)-  Thus  a 
natural  choice  for  the  7-function  would  be  the  one  for  which 
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KXq)  = 


T 

— MUM12 

*0 

I 

u-V m21m-|  IJxq 


1-1/2 


(4.3) 


In  that  case 


sup  |IF(x,y;W,Fa  )|  <c<«  (4.4) 

x,y  “0 


where  c  is  a  finite  constant.  See,  for  example,  Krasker  and  Welsch  (1982). 

There  are  other  choices  of  !/(&)  for  which  the  influence  function  of  the  test 
statistic  remains  bounded. 

(1)  Following  Schweppe's  proposal  introduced  in  Handshin  et  al.  (1975),  we 
choose  i^)  =  (l-sj^xr1^)1/2  =  (l-h0)1/2.  Then,  as  hQ-l, 

Kx0)  — ♦  0,  where  h^  is  the  leverage  that  corresponds  to  the  outlying  case 
(iSq^q)-  Then,  the  sensitivity  of  the  test  statistic  W  is 

sup|IF(x,y;W,Fa  )|  <  sup||(-M21  MjJIFfoyj^F  *  )|| 
x,y  ^0  x,y  m 

<  Cj  <  ao 

that  is,  the  sensitivity  of  the  test  is  bounded  by  a  rescaled  version  of  the  sensitivity 
of  the  reduced  model  generalized  M-estimator. 

(2)  Welsch' s  (1980)  proposal  consists  of  choosing  t^)  = 

Then,  as  hQ  —  1, 

i^2Jq)  — ♦  0  and  thus,  the  influence  function  of  the  test  statistic  reduces  to  a  rescaled 
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version  of  the  influence  function  of  the  reduced  model  generalized  M-estimate, 
which  has  bounded  sensitivity. 

Generally,  starting  with  a  bounded  influence  estimator,  the  influence 
function  of  the  test  based  on  this  estimator  will  be  bounded. 


5.  EXAMPLE 

The  data  design  comes  from  a  paper  by  Hill  and  Holland  (1977)  and  consists 
of  six  columns.  Table  5.1  contains  the  values  of  the  explanatory  variables  in  its  first 
six  columns.  Column  number  7  corresponds  to  the  data  values  of  the  dependent 
variable  y. 


Table  5.1:  Data  Values 


Column  1 
\Row 

2 

3 

4 

5 

6 

7 

1 

.2712 

.2712 

-.0453 

.0257 

-.0880 

.0288 

.92718 

2 

.2712 

.1627 

.1092 

-.1268 

-.0509 

.0470 

-.06165 

3 

.2712 

.0542 

.4513 

.0963 

.0140 

.0682 

-.74198 

4 

.2712 

-.0542 

-.1605 

.2977 

-.1065 

.0225 

-.31325 

5 

.2712 

-.1627 

.2242 

-.3618 

.2463 

.3193 

-.18593 

6 

.2712 

-.2712 

.0107 

.1246 

-.0814 

.0461 

.18593 

7 

.1627 

-.2712 

.1937 

.1006 

-.0373 

.0583 

.31325 

8 

.0542 

-.2712 

-.2435 

.3205 

-.1373 

.0404 

-1.40377 

9 

-.0542 

-.2712 

-.0094 

-.4123 

-.0852 

.0228 

1.12690 

10 

-.1627 

-.2712 

.1382 

.4631 

-.0630 

-.0112 

1.87129 

11 

-.2712 

-.2712 

.0956 

.0984 

-.0489 

.0388 

-.91718 

12 

-.2712 

-.1627 

.0597 

-.1136 

-.0732 

.0327 

.74198 

13 

-.2712 

-.0542 

-.0613 

-.1263 

-.0944 

.0303 

.06165 

14 

-.2712 

.0542 

.1282 

.0598 

-.0680 

.0691 

-1.87129 

15 

-.2712 

.1627 

-.0966 

-.0085 

.1387 

-.0672 

-.58740 

16 

-.2712 

.2712 

-.1060 

-.3819 

-.1340 

.0559 

-.44602 

17 

-.1627 

.2712 

.2013 

.0145 

-.0290 

.0966 

1.40377 

18 

.0542 

.2712 

.0914 

.0840 

-.0417 

-.0620 

.58740 

19 

-.0542 

.2712 

-.4324 

-.2083 

-.1520 

-.9198 

-1.12690 

20 

.1627 

.2712 

-.5486 

.0544 

.8917 

.0833 

.44602 

17 


The  model  to  be  fitted  is 


yi  “  h  +  4*il+Vi2  +  %*i3  +  <J4xi4  +  Vi5  +  Vi6  +  'i 

i  =  1,*  •  *,20. 


(5.1) 


In  vector  formulation  the  above  model  can  be  written  as: 


X  =  x 


[41 

4 


+  e 


(5.2) 


where  =  (/Jq,/^,/^)  =  (Z^,/^,/^,/^).  We  are  interested  in  testing  the 

hypothesis 


Hq  :  ^  =  fi,  ^  unspecified  vs 
Hj  :  ^  #  Q,  unspecified. 

In  an  analysis  of  variance  setting  we  can  interpret  the  above  hypothesis  as  testing 
for  the  significance  of  the  covariates  in  a  designed  experiment. 

The  reason  for  choosing  this  particular  data  set  is  that  the  observations 
exhibit  varying  degrees  of  leverage.  The  first  two  columns  of  the  data  set 
correspond  to  variables  like  those  in  a  designed  experiment,  and  hence  they 
represent  a  very  well  behaved  low  leverage  situation.  The  next  two  columns  were 
selected  to  represent  a  sample  from  a  bivariate  normal  distribution,  and  they  are 
also  low  leverage  observations. 

Columns  5  and  6  were  chosen  to  represent  a  sample  of  20  observations  from  a 


distribution  with  outliers.  Two  independent  Cauchy  samples  of  size  20  were  drawn 
and  the  largest  observations  in  each  sample  were  moved  until  they  contributed  80% 
and  85%  to  the  total  sum  of  squares  of  their  columns,  respectively.  Thus,  those 
two  columns  represent  a  high  leverage  situation.  The  two  high  leverage  cases  are 
located  in  rows  19  and  20. 

Note  that  the  leverage  of  the  nineteenth  case  is  h19  =  .952986  and  that  of 
the  twentieth  case  is  h2Q  =  .747831.  If  hj  >  2(p/n),  where  (p/n)  is  the  average 
leverage,  p  is  the  number  of  parameters  and  n  is  the  sample  size,  then  we 
characterize  the  i^  observation  as  a  high  leverage  observation;  see  Belsley  et  al. 
(1980).  In  this  particular  example  2(p/n)  =  .7,  and  so  clearly  observations  19  and 
20  are  high  leverage  ones. 

After  the  above  six  columns  were  selected,  each  column  was  standardized  to 
have  mean  0  and  unit  sum  of  squares. 

To  generate  a  set  of  dependent  variables  y  (column  7"in  table  5.1),  we 

generated  20  normal  scores.  Regressing  the  obtained  set  of  normal  scores  onto  the 

six  columns  generated  before,  we  selected  the  random  permutation  of  the  normal 

o 

scores  that  gives  us  a  small  R  -coefficient.  Therefore  the  experiment  has  been 
constructed  so  that  no  regression  effect  is  present. 

We  would  like  to  show  how  a  combination  of  high  leverage  cases  with 
outliers  in  the  dependent  variables  distorts  the  results  of  an  analysis  based,  not-only 
on  the  classical  F— test  procedure,  but  also  on  robust  procedures  that  do  not 
accommodate  high  leverage  cases.  To  this  end  we  replace  the  dependent  value  that 
corresponds  to  the  nineteenth  case  by  y^g  =  -7.84.  This  new  value  is 
approximately  six  standard  deviations  away  from  the  original  value.  The  least 


squares  regression  with  y19  =  -7.84  gives  R2  =  72.6%  and  an  R2-adj  =  60%. 
Table  5.2  contains  the  value  of  the  various  statistics  and  their  p— values: 


Table  5.2:  Statistics  and  their  p-values 


Statistic 

Numerical 

Value 

P-value 

F-statistic 

7.9307475 

.0008 

M-test 

13.86 

.008 

aligned  generalized 

3.69264 

.4492 

M-test 

To  calculate  the  M— test  we  used  Huber's  ^-function  with  c  =  1.345  (that  gives 
95%  efficiency  at  the  normal  model).  Hence,  in  this  example,  the  bounded  influence 
aligned  generalized  M— test  out  performed  both  the  F— test  and  the  M-test. 

To  calculate  the  aligned  generalized  M-test  we  chose  to  use  Welsch’  s 
weights;  that  is,  the  weight  function  i/(x)  is  given  by 


*00  - 


llT(xV»  ^ 

(xT(XTX)_1x)1/2  h1^2 


(5.3) 


where  h  =  x^(XTX)_1i.  Then 


sin 


«<x.)  sign 


if  |d|  |  <  c 
if  |d.|  >  c 


(5.4) 
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where  s(i)  is  the  standard  deviation  calculated  without  the  i1*1  case  and  is  given  as 

r? 

(n-p-l)s2(i)  *  (n-p)s2  -  (5.5) 

h.  is  the  leverage  of  the  i^  case,  r-  is  the  residual  of  the  i^  case  and  dj  is  the 
corresponding  DFFITS,  which  is  defined  as 

yj-yjO)  fbji1/2 

di  =  DFF,TSi  =  -^=[i=Er]  ‘i  (M) 

t.  is  the  externally  studentized  residual  (see  Belsley,  Kuh  and  Welsch,  1980).  The 
DFFITS  can  be  interpreted  as  the  change  in  the  fitted  value  that  results  when  the 
ith  case  is  deleted  and  the  difference  is  scaled  by  an  estimate  of  the  standard 
deviation  of  the  fitted  value. 

The  calculations  are  carried  out  using  iterated  least  squares  by  making  the 
identification,  using  (2.7), 


rtx;  £)  =  w(*,r)£ 


(5.7) 


Then 
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M  =  E 


where  ^(t)  =  I(|t|  <  c)  and 


Q  =  E{w2(i;r)(£)2s  iT) 


Estimates  are: 


(5.8) 


(5.9) 


(5.10) 


(5.11) 


In  the  weighted  least  squares  formulation  using  Weisch's  weights,  cases  are 
smoothly  downweighted  according  to  how  much  |  dj  |  exceeds  c.  Following 
the  recommendation  in  Belsley,  Kuh,  and  Welsch  (1980),  we  took 
c  =  (p/n)1/^  =  1.183.  The  value  of  the  aligned  generalized  M-test  statistic  is 
equal  to  3.69  with  an  approximate  p-value  of  .4492.  Thus  we  fail  to  reject  the  null 
hypothesis.  On  the  other  hand,  the  F  and  M  tests  strongly  reject  Hq  and  yield 
misleading  results. 

To  estimate  the  scale  parameter  a  for  the  calculation  of  the  M-test  we  can 
either  use  Huber' s  proposal  2  (Huber,  1981)  or  the  estimator  a  =  2.1  *  med{  |  r*  | }, 
where  |r*|  are  the  n-p+1  largest  absolute  values  of  the  residuals  (Hill  and 


22 


Holland,  1977).  In  our  calculation  of  the  M-test  we  used  the  Hill  and  Holland 
estimator  of  the  scale. 


6.  SUMMARY  AND  CONCLUSIONS 

The  effect  that  the  presence  of  a  combination  of  highly  influential  points, 
that  is  points  of  high  leverage  and  outliers  in  the  y  variable,  has  on  the  testing 
procedures  has  been  studied. 

In  the  least  squares  context,  aberrant  cases  can  determine  the  estimators  of 
the  true  parameter  vector.  They  confuse  the  results  of  the  testing  procedure  based 
on  the  least  squares  estimators,  since  the  test  reflects  the  contribution  of  those 
individual  points  in  the  model. 

The  M-test,  in  this  context,  is  unreliable,  as  well  as  the  F-test,  though  the 
M-test  shows  a  better  behavior  than  the  F-test  with  a  p-value  .008  compared  to 
.0008  of  the  F-test.  The  better  behavior  of  the  M-test  can  be  explained  by  the 
fact  that  the  test  accommodates  large  residual  points;  but  it  does  not  accommodate 
high  leverage  points. 

The  aligned  generalized  M-test  shows  the  best  behavior.  Designed  to 
accommodate  points  with  large  residuals  as  well  as  high  leverage  points,  it  does  give 
reliable  results  in  the  presence  of  the  above  mentioned  combination.  Its  influence 
function  is  bounded.  It  agrees  also  with  the  r-test,  to  which  it  is  a  practical 
alternative,  since  one  does  not  face  the  problem  of  computing  p-values  associated 
with  linear  combinations  of  differentially  weighted  chi  square  random  variables. 
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Thus,  an  elegant,  computationally  easy  and  reliable  alternative  to  the  class 
of  r— tests  is  the  class  of  aligned  generalized  M— tests. 
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